Price Doc Densities

Price_doc

Log price doc

Floor

Product Type vs Price_doc

### Product TYpe vs log(price_doc)

Factor Variable plots

thermal_power_plant_raion

##     no   yes
##             
##  28817  1654
##          no        yes
##                       
##  0.94571888 0.05428112

incineration_raion

oil_chemistry_raion

radiation_raion

railroad_terminal_raion

Little data preprocessing

Timestamp Plots

Simple statistics

### Frequency by Timestamp

Mean Price

Median Price

Extract month from timestamp

simple statistics , grouped by month

Data Table stats

Mean Price

All at once

Bocplot with geom_jitter

###

Highcharter Plots

loading time consuming :(

Highcharter

Data Table

Highcharter, few stats on one plot

Sub Area tree map

Count by sub_area

## 
Read 98.5% of 30471 rows
Read 30471 rows and 292 (of 292) columns from 0.043 GB file in 00:00:03

Tree map plot

Interactive Tree map plot

20170502 - end.

Differences in test dataset

Sub Area tree map TEST SET

Count by sub_area

Tree map plot - test dataset

Interactive Tree map plot

Simple statistics

### Frequency by Timestamp

Update 20170505 does not make sense to look deeper in test dataset. Todo: independent kernel with lasso, l1, penalized and trees. end.